Understanding Web Archiving Services and Their (Mis)Use on Social Media

نویسندگان

  • Savvas Zannettou
  • Jeremy Blackburn
  • Emiliano De Cristofaro
  • Michael Sirivianos
  • Gianluca Stringhini
چکیده

Web archiving services play an increasingly important role in today’s information ecosystem, by ensuring the continuing availability of information, or by deliberately caching content that might get deleted or removed. Among these, the Wayback Machine has been proactively archiving, since 2001, versions of a large number of Web pages, while newer services like archive.is allow users to create on-demand snapshots of specific Web pages, which serve as time capsules that can be shared across the Web. In this paper, we present a large-scale analysis of Web archiving services and their use on social media, shedding light on the actors involved in this ecosystem, the content that gets archived, and how it is shared. We crawl and study: 1) 21M URLs from archive.is, spanning almost two years; and 2) 356K archive.is plus 391K Wayback Machine URLs that were shared on four social networks: Reddit, Twitter, Gab, and 4chan’s Politically Incorrect board (/pol/) over 14 months. We observe that news and social media posts are the most common types of content archived, likely due to their perceived ephemeral and/or controversial nature. Moreover, URLs of archiving services are extensively shared on “fringe” communities within Reddit and 4chan to preserve possibly contentious content. Lastly, we find evidence of moderators nudging or even forcing users to use archives, instead of direct links, for news sources with opposing ideologies, potentially depriving them of ad revenue.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entity-Based Opinion Mining from Text and Multimedia

Social web analysis is all about the users who are actively engaged and generate content. This content is dynamic, reflecting the societal and sentimental fluctuations of the authors as well as the ever-changing use of language. Social networks are pools of a wide range of articulation methods, from simple ”Like” buttons to complete articles, their content representing the diversity of opinions...

متن کامل

Social Media in Public Libraries: Recognition of Applications, Obstacles and Problems of Use

Background and Aim: Social media because of its interactive nature and the fact that it is   being free of charge is widely used in libraries. Web 2.0 is a tool that offers permanent connection every time and offers educational programs without limitations of place and time. But what is included in social media application in public libraries and what obstacles and problems are there in the way...

متن کامل

Stories From the Past Web

Archiving Web pages into themed collections is a method for ensuring these resources are available for posterity. Services such as Archive-It exists to allow institutions to develop, curate, and preserve collections of Web resources. Understanding the contents and boundaries of these archived collections is a challenge for most people, resulting in the paradox of the larger the collection, the ...

متن کامل

WADL 2016 Panels: Worldwide activities on Web archiving; Social media, Web archiving, and digital libraries

In addition to presentations based around scholarly papers, the Web Archiving and Digital Libraries 2016 workshop also featured two panel discussion sessions. Each panel centered on a theme and featured short presentations by the panelists, followed by a moderated discussion and interaction with workshop participants; the panels are described herein. 1. PANEL 1: WORLDWIDE ACTIVITIES ON WEB ARCH...

متن کامل

A Survey of Librarians' Perspectives on Marketing Library Services Using Social Media in Tehran, Iran, and Shahid Beheshti Universities of Medical Sciences

Background and Aim: The present study has examined librarians' views on the marketing of library services using social media as well as the applications, benefits, and challenges of their use in Tehran, Iran, and Shahid Beheshti Universities of Medical Sciences.  Materials and Methods: This research was a descriptive and applied survey and was conducted in 2019. The data collection tool was a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1801.10396  شماره 

صفحات  -

تاریخ انتشار 2018